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Abstract 

We consider a two-player game in which the first player (the Guesser) tries to guess, edge- 
by-edge, the path that second player (the Chooser) takes through a directed graph. At each 
step, the Guesser makes a wager as to the correctness of her guess, and receives a payoff 
proportional to her wager if she is correct. We derive optimal strategies for both players for 
various classes of graphs, and describe the Markov-chain dynamics of the game under optimal 
play. These results are applied to the infinite-duration Lying Oracle Game, in which the 
Guesser must use information provided by an unreliable Oracle to predict the outcome of a 
coin toss. 
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1 Introduction 

In this paper we study a two-player game in which the first player (the Guesser) tries to guess, 
edge-by-edge, the path that the second player (the Chooser) takes through a directed graph. 
At each step, the Guesser makes a wager as to the correctness of her guess, and receives a payoff 
proportional to her wager if she is correct. Optimal strategies for both players are derived for 
various classes of graphs, and the Markov-chain dynamics of the game are analyzed. 

The Path Guessing Game studied here is a generalization of the Lying Oracle Game [UEI]. 
In the Lying Oracle Game, an Oracle makes a sequence of n statements, at most k of which 
can be lies, and a Guesser makes bets on whether the Oracle's next statement will be a lie or 
not. We will see that the Lying Oracle Game is equivalent to our Path Guessing Game on a 
certain graph whose maximum out-degree is two. Ravikumar in [3] demonstrates a reciprocal 
relationship between the Lying Oracle problem and the continuous version of Ulam's Liar 
Game. In that game, a Questioner tries to find a subset of smallest measure that contains an 
unknown number in [0, 1] by asking a Responder n questions about the number's location in 
the interval. Again the Responder may lie up to k times. Under optimal play, the measure of 
the Questioner's subset is the reciprocal of the Bettor's fortune in the Lying Oracle Game. In 
4 Rivest et al. use this game to analyze binary search in the presence of errors. 

In addition to its intrinsic interest, the Path Guessing Game provides a context in which 
new questions about the Lying Oracle Game can be asked and answered. For instance, what 
are the optimal strategies for the infinite-duration Lying Oracle Game, in which no block of 
n statements can contain more than k lies? Questions of this sort are taken up in the last 
section of this paper, after a general analysis of the Path Guessing Game has been carried out. 



1.1 Description of the Game 

To describe the Path Guessing Game precisely, let Q — (V, £ ) be a directed graph with vertex 
set V and edge set £. Call a vertex j £ V a terminal node if it has out-degree zero, and assume 
that each terminal node j has been assigned a positive value Vj. Both players know these 
values. The Guesser and the Chooser are initially located at the same non-terminal node i of 
Q, and the Guesser has an initial fortune of one dollar. From the current node, the Chooser is 
going to select one of the adjacent nodes to move to next. But before he does so, the Guesser 
writes down a prediction as to which node the Chooser will select. In addition, the Guesser 



makes a wager with the Chooser as to the correctness of her guess. (At this point the Guesser's 
prediction is unknown to the Chooser, but the Chooser does know the amount of the wager.) 
Next, the Chooser makes his selection of the node j to move to next. If the Guesser was 
correct in her guess, then she receives a payoff proportional to the amount of her wager, while 
if she was incorrect, she loses the amount of her wager. The payoff rule is explained in more 
detail below. After the payoff both players move to the node j selected by the Chooser. If 
node j is a terminal node, then the Guesser's fortune is multiplied by the corresponding value 
Vj , and the game is over. If node j is non-terminal, play repeats from there. The goal of the 
Guesser is to maximize her expected fortune, while the goal of the Chooser is to minimize the 
Guesser's expected fortune. We are interested in the optimal strategies for the players in this 
game, and the dynamics of play under the optimal strategies. 

We have adopted a payoff rule which rewards the Guesser for risk-taking. To motivate this 
rule, consider first the situation in which the players are at a non-terminal node i of out-degree 
rii > 2. Then "on average" the Guesser will lose her wager m — 1 times for every time that she 
wins. This being the case, fairness suggests that her payoff on winning should be n» — 1 times 
her loss on losing. This odds-weighted payoff gives the Guesser incentive to make non-zero 
wagers at nodes of high out-degree. Next, consider the case in which the players are at a 
non-terminal node i of out-degree n» = 1. The Guesser's prediction will of course be correct in 
such a case, but the odds- weighted payoff in this case would be zero. To encourage the Guesser 
to wager, we choose to suspend odds-weighting in the n» = 1 case, and allow the Guesser to 
take advantage of a sure bet. Thus our payoff rule is 



new fortune 



current fortune + (m — 1) ■ wager if m > 2 and Guesser is correct, 
current fortune + wager if m = 1 and Guesser is correct, (1) 

current fortune — wager if Guesser is incorrect. 



Note that if the out-degree of every vertex is two or less, then the payoff rule is simply: the 
Guesser wins the amount of her wager if her guess is correct, and loses it otherwise. 

In addition to the payoff rule, optimal strategies obviously depend on the type of graph in 
which the game is being played. Call a graph terminating if every node has a path to some 
terminal node. For terminating graphs, we will show that the values of the non-terminal nodes 
can be derived from the pre-assigned values of the terminal nodes, and that optimal strategies 
can be derived from these values. We will also consider the strongly connected case, which is 
interesting because there are no terminal nodes, no pre-assigned values, and the game duration 
is infinite. 

Most of our notation is standard. Random variables and matrices are denoted with upper- 
case letters; constants and vectors are lowercase. We will write i — > j to indicate that there 
is a directed edge from node i to node j in the graph Q. The out-degree of vertex i will be 
denoted by n or t»i. The symbol 1 will indicate either a vector or matrix, all of whose entries 
are 1. The dimensions should be clear from the contex. For readability, the Chooser will be 
consistently referred to as "he" , and the Guesser as "she" . 



2 The Game on Fans 

Let Q be a fan with n leaves (i.e. a tree of height 1 on n + 1 nodes) . The players are initially 
located at the root node of Q, and each leaf has a positive value Vj assigned to it. We begin 
by delineating the strategy sets for each player. 

Let E c denote the set of pure strategies available to the Chooser. The Chooser's play 
consists of selecting the destination node, so 

S c = {j : 1 < j < n}. 

On the other hand the Guesser must choose both a wager and a guess as to the destination 
node. Thus her strategy set E 9 is 

E 9 = {(j,w) : 1 < j < n and w £ [0, 1]}. 

Both players are allowed to play non-deterministically. Let C denote the (generally random) 
destination node selected by the Chooser, and similarly let G and W denote the Guesser's 
guess and wager respectively. Naturally we assume that the Chooser's choice C and Guesser's 
guess G are independent: neither the Chooser nor the Guesser are mind readers. However, the 



wager is publicly announced, so we leave open the possibility that the Chooser's choice C is 
dependent on the Guesser's wager W. Thus, the Chooser's strategy is completely determined 
by the conditional probabilities 

p(j\w) = Prob (C = j\W = w) . 

The Guesser's strategy is specified by the joint distribution of G and W. This is determined 
by the conditional distribution of G given W, 

q(j\w) = Prob (G = j\W = w) , 

along with the unconditional distribution of W . 

Let F denote the Guesser's fortune at the end of the game. The Chooser's goal is to 
minimize E [F] , while the Guesser wants to maximize E [F] . 

Theorem 1 (Optimal play in the game on fans). Let Q be a fan with n leaves, each of which 
has been assigned a positive value Vj . The optimal strategy for the Chooser in the Path Guessing 
Game on Q is to make his choice independently of the Guesser's actions, with 

fl ifn = l 

p(j) = Prob(C = j) = { „-i (2) 
^„ J -i ifn>2 

The optimal strategy for the Guesser is characterized by 

W={ 1 ifn = 1 (3) 

{ 1 - nf3pm in ifn>2 

and 

{1 if n — 1 

npO)-l + » L >2 W 

nw — 

where p m i n = min{p(j) : 1 < j < n} and (3 is any random variable with support in [0, 1] . Under 
optimal play, the Guesser's expected fortune at the end of the game is given by 



E[F] 



2vj if n — 1 

" —i tfn>2 



Proof. The n — 1 case is trivial, so assume n > 2. Let H denote the harmonic mean of the 
values 

H= " _ t . (5) 

We first show that no matter what strategy the Chooser employs, the Guesser can always 
find a strategy such that E [F] > H. Towards that end, consider any Chooser strategy 
(p(k\w), k € V). Then for all j £ V and w G [0, 1] we have 

E [F\G = j, W = w] = (1 + (n - l)w) v jP (j\w) + {l-w)J2 v k p(k\w) 

n 

= nwvjp(j\w) + (1 — w) Vkp(k\w) 
k=l 

= wnvjp(j\w) + (1 — w) v 
= w (nvjp(j\w) — v) + v 

where to ease notation v = 5Zfc=i Vkp{k\w). Define the critical wager 

w c = l- Hv m i x , (6) 

where ij m « is the maximum value of the nodes in Q. Fix any w G [w c , 1], and choose j — j(w) 
such that 

VjP{j\w) > v k p(k\w) (7) 
for all k G V. We claim that for this j and w we have 

E [F\G =j,W = w]> H, (8) 



with equality if and only if 



p(k\w) = k (9) 
2Zt=i v e 



for all k, in which case we actually have 

E[F\G = k,W = w]=H (10) 

for all k. To prove the claim we need to show that 

(1 - pHv^) (nv jP (j\w) - v) + v > H (11) 

for all [3 € [0, 1]. But since j satisfies ©, we have nvjp(j\w) — v > 0. Thus, it is necessary 
and sufficient to prove (|lip for (3 — 1: 

(1 - Hv~L) {nv jP (j\w) -v)+v>H. (12) 

Let S = v k 1 j so that SH = n. Multiplying both sides of (|12p by Su max and rearranging, 

we see that (fTT) is equivalent to 

p(j\w)vj (SVmax - Tl) > (13) 

To establish (|13[L note first that 

"max 



Sv max . Tl — ^ ^ 

Therefore 



p{j\w)vj (Sv max - ra) = p{j\w)vj — — 

— ^fe 

fc=l 



y^p(fc|w)- 



n 

"max ^fc 

) (W) 

where we have used ([7J for the inequality. Therefore we have (|13p . and hence ©. Moreover, 
there can be equality in (|14|l - and hence in (JHJ) - if and only if 

p(k\w)v k = p(j\w)vj 

for all k, which immediately implies (|9}. It is straightforward that ((9]) implies (|10[) , so we 
have the claim. Thus no matter what the Chooser's strategy is, the Guesser can always find a 
strategy such that E [F] > H . Moreover, the claim shows that if the Chooser uses any strategy 
other than ((9]), then Guesser can find a strategy such that the inequality is strict. 

Now we show that no matter what strategy the Guesser uses, the Chooser can always find 
a strategy that forces E [F] < H. Towards that end, note that 

E [F\C = j, W = w] = (1 + (n - 1)to) q(j\w)vj + (1 - q(J\w)) Vj 

= Vj (w(nq(j\w)-l) + l), (15) 

We claim that the expectation in (|15[) is equal to H for all j if and only if 

w = 1 - f3Hv~l x (16) 

and 

q{jH = np(j)-l +W ^ 
nw 

where p(j),j € V are given by ((2]), and f3 G [0, 1] is arbitrary. To see this, note that HvJ 1 = 
np(j), so the expectation in (|15p is equal to H for all j if and only if 

np(j) — 1 = w (nq(j\w) — 1) for all j. 



Clearly there is a one-parameter family of solutions to this diagonal system, and direct sub- 
stitution verifies that (|16|) and (|17|) constitute the solution set. We further claim that for any 
other choice of strategy by the Guesser, the Chooser can find a strategy p which makes the 
Guesser's expected fortune strictly less than H. To see this, note that (|15|l implies that 



E 

3=1 



E[F\C = j,W = w] 



for any choice of w and q(j\w),j G V. We already know that the expected values in the sum 
are all equal to H if and only if the Guesser's strategy is given by (fl6|) and (fl7)l . Thus, for any 
other choice of the strategy, some of the expected values in the sum will have to be greater than 
H , and more to the point, some will have to be less than H (so that the value of the sum is 
still n). Therefore by concentrating his strategy on the j for which E [F\C — j, W = w] < H , 
the Chooser can force E [F] < H , as claimed. This completes the proof. □ 

A few comments before moving on. By allowing the players to begin play at any node of 
the fan Q, we can think of the pre-assigned values of the leaf nodes as conditional expectations: 

Vj = E [F|Gucsser starts at node j with $1] . 

Theorem [T] then gives the value of the root node i: 



E [F | Guesser starts at node i with $1] 



if n = l 
if n > 2 



Thus we see that the pre-assigned values of the leaf nodes propagate up the fan to the root 
node. This observation will be used repeatedly in the sequel. 

The Chooser's optimal strategy is unique, but Theorem [1] shows that there is a family of 
optimal strategies for the Guesser: the parameter /3 in (|19[) can have any probability distri- 
bution on [0, 1] whatsoever. Theorem [T] would be simpler if we required that the wager W be 
deterministic. Then the optimal strategies for the Guesser would form a one-parameter family, 
given by 

g(j)=Prob(G = j) = y - ffP-i" (18) 

and 

w = 1 - n/3p min (19) 

The parameter /3 G [0, 1] would then parameterize the Guesser's optimal strategies, and in 
each one, the wager would be deterministic. In the interest of simplicity, we will take the 
wager to be deterministic throughout the rest of the paper. Accordingly, we drop the random 
variable notation W, and simply denote the wager by w. 

Note that the critical wager © corresponds to /3 = 1 in (|19p . The corresponding proba- 
bilities for the Guesser are 

P(j) - Pn 



-'mm 



We will refer to this as the minimum risk optimal strategy for the Guesser. The maximum risk 
optimal strategy for the Guesser corresponds to /3 — in equations (|18p and (|19p . in which case 
the wager is w = 1 and the Guesser's probabilities are equal to the Chooser's. Although all the 
Guesser's strategies are optimal in the sense of maximizing her expected fortune, the maximum 
risk strategy carries with it the distinct possibility that the Guesser will be bankrupt at the end 
of the game. From a practical point of view, the minimum risk strategy is clearly preferable. 
Interestingly, in the n = 2 case, if the Guesser follows her minimum risk optimal strategy, then 
her fortune is guaranteed to equal the harmonic mean of the values: F — E [F] — H with 
probability one. The proof is left to the interested reader. 



3 The Game on Trees 

Let Q be a tree. The Path Guessing Game on Q is a straightforward extension of the game 
on fans. Again, the terminal nodes of Q are the leaves, which we assume have been assigned 
positive values Vj. The players are initially located at the root. Note that if play has progressed 
to the point where the players are at a node i whose children are all leaves, then at that point, 
they are playing the game on a fan. The optimal strategies are given by Theorem [TJ the only 



difference being that the Guesser's fortune is some number F rather than one dollar. But we 
can account for this by thinking of the wager w 6 [0, 1] as the proportion of the Guesser's 
fortune that is wagered. If the out-degree of node i is at least 2, then by Theorem Q] the 
Guesser's expected fortune at the end of the game, given that her fortune at node i is F, is 
HiF, where Hi is the harmonic mean of the values of the leaves of node i. Thus we should 
assign the value of node i to be Vi = Hi in this case. In the same way, if the out-degree of % 
is 1, then Theorem [1] implies Vi = 2ve, where I is the single child of i. Likewise, values can be 
assigned to all the interior nodes of the tree, all the way up to the root. Playing in accordance 
with these values is optimal, by a straightforward induction on the height of the tree. These 
ideas are summarized in the following theorem. 

Theorem 2 (Optimal play on trees). Let Q be a finite rooted tree in which each leaf node £ 
has been assigned a positive value vi . Define the values of all the other nodes i of Q by 

vJ 1 /2 ifm = \ and i -> j 

-1^ -1 f ^r, ( 20 ) 

The optimal strategy for the Chooser is given by 

if ni = 1 and i — » j 

if ni > 2 and i — * j (21) 
otherwise 

There is a one-parameter family of optimal strategies for the Guesser, given by 

1 if ni — 1 and i — > j 

(Pi,i - Ppi,min) I (1 - nifipij) ifm > 2 and i — > j (22) 
otherwise 




and 



1 if ni = 1 

1 — ni/3pi, min ifm > 2 



(23) 



where pi, m m = ram{pi^ : j such that i —> j}, and /3 is arbitrary in [0, 1]. Under optimal play 
the Guesser's fortune F at the end of the game satisfies E [F] = vq, where vo is the value of 
the root node of the tree. 



4 The Game on Terminating Graphs 

A connected digraph in which every non-terminal node has a (directed) path to some terminal 
node will be referred to as a terminating graph. In this section we assume that Q is a termi- 
nating graph with N vertices, in which each terminal node j has been pre-assigned a positive 
value Vj . Note that since a terminating graph may contain cycles and other strongly connected 
subgraphs, the duration of play is potentially infinite. We will derive optimal play in the Path 
Guessing Game on Q by first assigning appropriate values to the non-terminal nodes, then 
proving that playing the game in accordance with these values (as in Theorem [1} is optimal. 

We begin by considering a truncated game which lasts for at most s steps. The truncated 
game is identical to the non-truncated game, except that if the players have not reached 
a terminal node by step s, the game is stopped. We will refer to this as the s-step game 
(even though it may be over in fewer than s steps). The s-step game can be completely 
described by iV NT path trees, where AW is the number of non-terminal nodes of Q. The path 
tree corresponding to a non-terminal vertex i represents all the paths emanating from i whose 
lengths are at most s. Terminal nodes in the original graph Q will be leaf nodes in the path 
trees. The non-terminal nodes that are reachable in s steps from node i will also appear as leaf 
nodes in the path tree corresponding to i. For the path trees in this truncated game, terminal 
nodes retain the values assigned to them in the non-truncated game. Leaf nodes of the path 
trees that correspond to non-terminal nodes are assigned the value of 1, in keeping with the 
idea that the game simply stops if one of these nodes is reached. Thus we have reduced the 
s-step game to a game on the path trees of length s, which can be solved by propagating the 
values up the path trees using Theorem [5] 

The propagation of the values for the s-step game can be summarized nicely using a matrix 
approach. Let AT NT be the number of non-terminal nodes, and let 7V T be the number of terminal 



nodes. Number the nodes so that the non-terminal nodes are numbered first, followed by the 
terminal nodes. Consider first the path trees for a 1-step game on Q. Let uo denote the vector 
of the reciprocal values of the leaf nodes. In keeping with our numbering convention, uo is a 
partitioned vector of the form 

u = (-L-) , (24) 

where 1 is the jV NT x 1 vector of ones, and u T is the JV T x 1 vector containing the pre-assigned 
reciprocal values of the terminal nodes of Q. By Theorem [1] the reciprocal values of the 7V NT 
root nodes of the path trees for the 1-step game are given by 

Uj = Muq, 

where M is the N x N propagation matrix M = (rm,j : i,j € V) defined by 

1 if i is a terminal node and i — j 

1/2 if m = 1 and i —> j 

1/m if m > 2 and i — > j 

, otherwise 



(25) 



Inductively, the reciprocal values for the root nodes of the path trees for the s-step game are 
given by 

u s = M s u . (26) 

Our first goal is to prove that lim^oo u a exists. In accordance with our numbering convention, 
the partitioned form of the propagation matrix M is 



M = 



' A 


B ' 





I 



(27) 



where A is N ST X iV NT , B is N NT X N T , the zero submatrix is jV T x -/V NT , and the identity submatrix 
is N T x N T - It follows that the powers of M are of the form 



M a = 



.4 s 







(28) 



The next lemma establishes that limiting values exist, are strictly positive, and that the limiting 
reciprocal value vector it satisfies Mu — u. 

Lemma 1 (Limiting Values). Let Q be a terminating graph, and let M be the associated 
propagation matrix defined by (|25[) . Then lim^oo M s exists, with 



lim M B 











(I — A) _1 B 



(29) 



where A and B are the submatrices defined by (|27[) . In particular, the limiting reciprocal value 
vector u = lim^oo M s uq exists, satisfies Mu — u, and is strictly positive. Partitioning the 
limiting reciprocal value vector consistently with the partition of M , 



(30) 



fir 



the limiting reciprocal values w NT of the non-terminal nodes are related to the values it T of the 
terminal nodes by 

u NT = {I -A)' 1 B Ui: , (31) 

Proof. Note that if all the eigenvalues of A were less than 1 in absolute value, we could 
immediately conclude that lim^oo X^i=o ^ ~ ~ an d the result would follow. We 

claim that in fact all the eigenvalues of A are strictly less than 1 in absolute value. To see 
this, first observe that A is a nonnegative sub-stochastic matrix, so by the standard Perron- 
Frobenius theory of nonnegative matrices (see [5], for instance), the maximal eigenvalue r of 
A is nonnegative and less than or equal to 1, and |A| < r for any other eigenvalue A of A. My 
claim is that r < 1. Suppose (by way of contradiction) that r = 1. Then there is an associated 
nonnegative eigenvector x, which we may assume without loss of generality has been scaled so 
that its maximum component is equal to 1. Now the equation Ax — x along with the definition 
(1251) imply the following: 



1. If the non-terminal node i of Q has out-degree 1 (i.e. rii — 1), and if the (sole) child of i 
is a terminal node j, then Xi = 0. 

2. If the non-terminal node i has m = 1, and if its (sole) child is a non-terminal node j, 
then Xi — Xj /2. 

3. If the non-terminal node i has rn > 2, then 

Xi = n~ 1 x h (32) 

jei(i) 

where J(i) = {non-terminal nodes j such that m%j 7^ 0}. 

Items [T] and [2] imply that entries of x corresponding to non-terminal nodes of out-degree 1 are 
all strictly less than 1. Now consider the remaining entries of x, which correspond to non- 
terminal nodes i of out-degree 2 or more. Let di denote the distance from non-terminal node 
i to the nearest terminal node. Item[T]says that if di = 1, then xt — 0. Now suppose di — 2. 
Then there is a directed edge from node i to some node j with dj — 1. Thus Xj < 1, and 
therefore Xi < 1 by (|32[) (since the cardinality of J(i) is at most n»). So any non-terminal node 
i with di — 2 must have Xi < 1. An easy induction shows that any non-terminal node i that 
is at a finite distance from some terminal node must have Xi < 1. But in a terminating graph 
every non-terminal node is at a finite distance from some terminal node. Therefore Xi < 1 for 
all non-terminal nodes i, contradicting our scaling of x so that its maximum component was 
1. Thus r < 1 as claimed, and we have equation (1291 1, 

From (|26p and (|29|) it follows that the limiting reciprocal value u — lim^oo u s exists, and 
satisfies 











(/- A)~ L B 



u Q . (33) 



Equation (|31|l follows immediately. Using (|31[) and (|27|l it is easy to see that the limiting vector 
it is a right eigenvector of the propagation matrix, corresponding to the maximal eigenvalue 
r = 1. 

It remains to show that u > 0. We claim that no row of (I — A)^ 1 B consists entirely of 
zeros. To see this, note that each non-terminal node i is connected to some terminal node k 
by a path in Q of some length s > 0. The node j immediately preceeding k in this path must 
be a non-terminal node. It follows that the i, j'-entry in A"~ l is nonzero, as is the j, fc-entry in 
B. Therefore row i of A S ~ 1 B can not consist entirely of zeros. Since (7- A)' 1 B is a sum of 
such (nonnegative) terms, its i th row can not consist entirely of zeros either, establishing the 



claim. Now the pre-assigned reciprocal value vector u T is positive by assumption, so by (|3ip 
we conclude that u NT is strictly positive. This completes the proof of the lemma. □ 

Now we turn to the full, non-truncated game. The previous lemma shows that for a 
terminating graph, any assignment of values for the terminal nodes leads to a unique set of 
limiting values for the non-terminal nodes. A reasonable option for the players in the non- 
truncated game is to play in accordance with these limiting values. For the Chooser, this 
would mean 

p lJ =Prob(A t+1 =j|X t = l ) = ( 1 -! n ^o' ( 34 ) 

where X t denotes the vertex occupied by the players at time t. For the Guesser, it would mean 
qt.j ;=Prob(G t+ i =j\X t =i) = \ 1 ifn s = l ^ 

{ (Pi,j — /?P»,min) / (1 — riif3pi tI nin) if Ui > 2 

where Gt+i is the Guesser's prediction of where the Chooser will go on the next step, with the 
wager at node i given by 

1 ifni = 1 , (36) 

1 - rn/3pi,inixx if m > 2 

where (3 is arbitrary in (0, 1], and Pi, m in is the probability of the vertex least likely to be chosen 
by the Chooser: 

Pi.min = min{pi,j : j 6 V, i — » j} 



We will refer to the strategies given by (|34[) . (|35|l . and (|36|) as the limiting strategies for the 
game. 

Let P = (pij : i,j £ V) denote the transition probability matrix corresponding to the 
Chooser's limiting strategy (|34p . These are the transition probabilities for the Markov chain 
that describes the players' position in the graph as a function of time. Strictly speaking, 
these probabilities only make sense if i is a non-terminal node. However, we can think of each 
terminal node as having an attached loop leading back to itself, with the understanding that 
all wagering and guessing stops as soon as the players reach a terminal node. Then pij — 1 if i 
is a terminal node, which makes the Markov chain Xt well-defined for all t. The end of actual 
play in the game is given by the hitting time T of the chain to the set of terminal nodes: 

T = min{t > : X t is a terminal node}. 

The next theorem gives the basic properties of the game under the limiting strategies. 

Theorem 3 (Limiting strategies). Let Q = (V, £) be a terminating graph in which each 
terminal node has been assigned a positive value. Let M be the propagation matrix for Q 
defined by (|25f) . and let v — (vi : i £ V) be the vector of limiting values. Then under the 
limiting strategies the Chooser's transition probability matrix P is diagonally similar to the 
propagation matrix: 

P = VMV~\ (37) 

where V — diag(v) is the diagonal matrix of limiting values. Under the limiting strategies, the 
game duration T is finite with probability 1, and the Guesser's expected fortune at the end of 
the game is equal to the value of the node at which play began: 

E[F T \X =i]=Vi. (38) 

Proof. Because the vector it of limiting reciprocal values satisfies Mu = u, equation (|34p 
reduces to 

Pi,j — vtii.j Uj/ui — mij Vi/vj 

for all i and j, and (|37|) follows. Because the limiting reciprocal value vector u is strictly 
positive, it follows from (|34|l that pi.j > whenever i — > j in Q. Therefore, the hitting time T 
is finite with probability 1 under the limiting strategies. 

For (|38|) . note first that it is obviously true if j is a terminal node. If i is non-terminal, 
then conditioning on the position of the players at time t — 1 gives 

E [F T \X = i] = [ 2E = A if * = (39 ) 

\j: r .^P^E[F T \X = i,X 1 = j] ifn>>2 

Now in the second case we have 

E [F T \X =i,Xx = j] = [q itj (1 + (m - l)v>i) + (1 - q itj ) (1 - Wi)] E [F T \X = j] 
= {wi -1) + 1)E [F T \X = j] 
= mpijE \Ft\Xq = j] 

= -E[F T \X =j], 

Vj 

so that (|39p becomes 

[2E [F T \X = j] if m = 1 and i ~* j 

E{F T \X = i]={ "Jprp.v , ., ^„ • (40) 

)>l, ., rrnj^E [F T \X = j] if n t > 2 

This is recognized as the matrix equation 

C = V 2 MV~ 2 C, 

where — E [Ft\Xo =i\- So the vector z = V~ 2 C, satisfies Mz — z, and using the partition 
in J27j> we get 

Az NT + Bz T = z NT , (41) 



where we have partitioned z consistent with the partition of M in (|27[) . We know that for 
terminal nodes i we have = Vi, and it follows that z T = V^ _2 £t = u T . Therefore the 
solution Zst of (|41l) is 2 NT = (L — A)" 1 Bu T , which equals u KT by (|31|) . Thus z = u, and 
£ = V 2 z = v. This completes the proof. □ 



Other quantities of interest in the game include the terminal probabilities 



p it k = Prob (X T = k\X = i) , 

the expected stopping times 

n = E [T\X = i] , 
and the conditional distributions of the stopping time 

q Mi = Prob (T = t\X = i) . 

Form the matrix 

P = (Pi,k '■ i is non-terminal, j is terminal) , 

and the vectors 

It = (?t|i : i is non-terminal) 

and 

r = (-Tj : i is non-terminal) . 

(In each case the values when i is terminal or j is non-terminal are obvious.) The next theorem 
expresses these quantities in terms of the propagation matrix and the limiting values. In the 
statement of the theorem, V NT = diag(u NT ) and V T = diag(w T ) are the diagonal matrices of 
limiting non-terminal and terminal values respectively. 

Theorem 4. Let Q be a terminating graph, and suppose the players use the limiting strategies 
defined by (|34[) , (|35p , and (|36p . Then the vector r of expected stopping times is given by 

r = V m (I — Ay 1 Vnt 1 !. 

The conditional stopping time distributions are given by 

qt = W'^'w; 1 !, 

and the terminal probabilities are 

p = V m {I-A)~ 1 BV- 1 . 

Proof. The proof proceeds by conditioning on the player's position at time t = 1. Details are 
left to the reader. □ 

As might be expected, the limiting strategies are in fact optimal for the game on terminating 
graphs. 

Theorem 5 (Optimal play on terminating graphs). Let Q be a terminating graph in which 
each terminal node has been assigned a positive value. Then the limiting strategies (|34l) . (I35|) . 
and (|36[) are optimal. 

Proof. It is clear that we can restrict our attention to strategies that are purely positional, in 
the sense that at every time the players are at a given vertex i, they play the same strategy. 
We will continue to use E to denote expected values under the limiting strategies, while E* 
will denote expected values under general (but fixed) purely positional strategies. Suppose 
both players are playing a general homogeneous strategy, possibly different from the limiting 
strategies defined by (|34[) . (|35|l . and (|36[) . Define v*(i) by 

v*(i) = limsupS* [F t \X = i] ■ (42) 

t — »oo 

Note that for terminal vertices i, v*(i) equals the pre-assigned value Vi. We first claim that if 
the players are playing optimally, then < v*(i) < oo for all non-terminal vertices i. Indeed, 
v*(i) > because the Guesser can always elect to bet zero at every vertex, while v*(i) < oo 
because the Chooser can always elect to take the shortest route from vertex i to a terminal 
vertex. 

Next we claim that if the Chooser is playing optimally, then T, the hitting time to the set of 
terminal states, must be finite with probability 1. To see this, suppose by way of contradiction 
that P*(T = oo) > 0. Then there must exist a non-terminal vertex i that is part of a strongly 
connected subgraph Q* of Q that can be visited infinitely often by the players. Without loss of 
generality there is an edge from i to a vertex j not in Q* that leads to some terminal node z, 
and such that the Chooser's probability of selecting the edge from i to j is zero. (Otherwise, T 



would be finite with probability 1.) The out-degree rii of i is therefore at least 2. Consider the 
following strategy for the Guesser: when at vertex i, the Guesser bets one half of her fortune 
on the node k that the Chooser is most likely to select; when at any other vertex in Q* , the 
Guesser bets nothing. Because the Chooser goes from i to j with probability 0, the probability 
p* k that he goes to the node k must satisfy 



Pi,k 



> 



Therefore the Guesser's expected fortune after playing at vertex i (as a proportion of her 
current fortune) are 

| (1 - Pi,k) + f 1 + 7j (n* - 1)^ Pi,k = \ {Ptk-ni + 1) 

>I(_!!*_ + l 

- 2 \rH-l 
> 1. 

Thus the Guesser increases her fortune on average each time the players are at vertex i. Since 
there is a positive probability this will happen infinitely often, v*(i) as defined by (|42p will 
be infinite. But this contradicts the claim proven above that v*(i) is finite if the Chooser is 
playing optimally. Therefore the claim that P*(T — oo) = is established. 

Now T is finite with probability 1, it follows that v*(i) as defined by (|42p satisfies 

v*(i) = E* [Ft\X = i] . 

provided that the players are playing optimally. Therefore, given that play starts at node 
i, the game on Q is equivalent to the game on the fan whose leaves have the values v*(j), 
where i — » j. Optimal play in this game is given by Theorem [T] Since this is true for each 
vertex i in Q, the reciprocal value vector u* = (u*(i) : i G V, u*(i) = must satisfy 

Mu* = u* , where M is the propagation matrix (I25p for Q. But u* must agree with u, the 
reciprocal value vector for the limiting strategy, on the terminal nodes, since the values of 
those are pre-assigned. It follows now from the proof of Lemma[T]that the non-terminal values 
u*, T must satisfy 

u nt — (I ~ Bu T , 

and therefore u* — u. This completes the proof. □ 

For what terminating graphs Q is the Path Guessing Game fair, in the sense that the 
Guesser's expected fortune at the end of the game is equal to the $1 that she started out with? 

Theorem 6. Let Q be a terminating graph. Then the Path Guessing Game on Q is fair if and 
only if each terminal node has a value of 1 and each non-terminal node has out-degree at least 
2. 

Proof. The game will be fair if and only if the value of every node is 1. From equation (|3ip 
this is equivalent to 1 = (I — A)" 1 _B1, and it follows from this that for each non-terminal 
i, the i th row sum of A plus the i th row sum of B must equal 1. By the construction of the 
propagation matrix, this can happen if and only if Q has no non-terminal node of out-degree 
one. □ 



5 Strongly Connected Graphs and the Discounted 
Game 

Let Q — (V, S) be a strongly connected digraph on N vertices. The rules for the Path Guessing 
Game on Q are identical to those for a terminating graph, the only difference being that there 
are no terminal nodes in a strongly connected graph. As in the previous section, we will derive 
optimal play on Q by considering the truncated game on Q obtained by the stopping the Path 
Guessing Game after s steps. 

We are interested in the values of the root nodes of the path trees for the s-step game, as 
s goes to infinity. Consider first the one-step game on Q. Equation (J20J of Theorem [2] holds 



for all vertices i, where the values of all the leaf nodes are 1. This can be summarized by the 
matrix equation 

Mi = Ml, 

where u\ is the vector of reciprocal values, 1 is the vector of all ones, and M = (m^.j : i,j 6 V) 
is the N x N propagation matrix given by 

{1/2 if m = 1 and i — > j, 
1/m if m > 2 and i — > j, (43) 
otherwise, 

where n; is the out-degree of vertex i. Inductively, reciprocal values for the root nodes of the 
path trees for the s-step game are given by 

u s = M'l. 

In contrast to the case of terminating graphs, limiting values for the s-step game may 
be infinite. For instance, if Q is a 2-cycle, then v s (i) — 2 s for i = 1,2. However, recall from 
Theorem[T]that optimal strategies depend only on the ratios of values. We will be able to show 
that limiting ratios continue exist in the strongly connected case, and that the corresponding 
strategies are optimal. 

Towards this end, consider the following discounted game on the strongly connected graph 
Q: play proceeds just as before, but after each payoff, the Guesser's fortune is multiplied by a 
fixed discount factor d £ (0, 1]. Intuitively, one can think of the discounted game as modeling 
a situation in which the real value of money is decreasing with time, as in an inflationary 
economy. (In this case, the reciprocal of the discount rate would be the inflation rate.) It is 
clear that optimal strategies for the discounted game are the same as for the undiscounted 
game. 

For the discounted 1-step game, the values of all the leaf nodes are d. Reciprocal values 
are therefore 1/d, and so the reciprocal values of the root nodes in the discounted game are 

Ux = Af (d _1 l) =d~ 1 Mt. 

Inductively, values for the root nodes of the path trees for the discounted s-step game are 

u B = d~ s Ml, (44) 

and the idea is to determine the value of the discount factor d that makes the limiting reciprocal 
values finite and nonzero. 

It turns out that the correct choice is to make the discount factor equal to the largest 
positive eigenvalue of the propagation matrix M . Before proving this, we will make one more 
assumption on the graph Q. In addition to being strongly connected, we will assume that Q 
is aperiodic. For a strongly connected graph, aperiodicity means that the greatest common 
divisor of the lengths of all the cycles in Q is one. (Note that any strongly connected graph 
that contains a loop is aperiodic.) Aperiodicity rules out certain cyclic phenomena that, while 
not unduly hard to characterize, serve mainly to cloud the important issues. The next lemma 
collects the properties of the propagation matrix M that we will need. 

Lemma 2. Let Q be a strongly connected aperiodic digraph, with associated propagation matrix 
M given by ()43|) . Then we have the following: 

1. M has a positive maximal eigenvalue r, with the property that any other eigenvalue A of 
M satisfies |A| < r. 

2. There are strictly positive right and left eigenvectors x and y respectively associated with 
the maximal eigenvalue r. 

3. The right and left eigenspaces of M associated with r ( and containing x and y respectively) 
each have dimension 1. 

4- No other eigenvector of M is positive. 

5. The maximal eigenvalue r satisfies ^ < r < 1. 

Proof. Since Q is strongly connected and aperiodic, it follows that M is irreducible and primi- 
tive, and properties [T] through U follow from the standard Perron- Frobenius theory of nonneg- 
ative matrices (see [5], for instance). Property [5] follows from the fact that all the row sums of 
M are between one half and one. □ 



We are now in a position to prove that limiting values exist for the discounted game. 

Lemma 3 (Limiting Values for the Discounted Game) . Let Q be a strongly connected aperiodic 
digraph, with associated propagation matrix M given by (I43p . Let r be the maximal eigenvalue 
of M , with associated positive right and left eigenvectors x and y respectively. Then we have 

lim r~ s M s = (45) 
s— >oo x y 

where the limit is a strictly positive matrix. Moreover, the limiting reciprocal value vector u 
for the discounted game (with discount factor d = r) given by 

T 

u = lim r- s M a l = ^-1 (46) 
s— >oo x y 

is a positive right eigenvector of M corresponding to r: 

Mu = ru (47) 

Proof. Equation (|45|l is easily derived from Lemma [5] and the Jordan canonical form for M 
(see [6], for instance). Equation (|46|l follows directly from (|44[) and (I45[) . Finally, 

Mu = M lim r~ s M s t 

s — >oo 

= r lim r~ {s+1) M s+1 t 

s — >oo 



and u is positive because both x and y are. □ 

Lemma [3] shows that setting the discount factor d equal to the maximal eigenvalue r of the 
propagation matrix results in a discounted game in which the limiting values are all finite and 
nonzero. From this point onwards, when we refer to the "discounted game", it is understood 
that the discount factor is the maximal eigenvalue of M. Also, for ease of notation we drop 
the tilde and simply use u to refer to the limiting reciprocal values for the discounted game. 

Turning towards the non-truncated infinite duration game, we now explore strategies cor- 
responding to the limiting values. For the Chooser, the transition probabilities are 

, = IV,1,(Y, II =.,1.Y, =,1 = ^ = * (48) 



while the Guesser has guessing probabilities 

qid =Prob(G t+1 =j\X t = i) = r l{ni = \ (49) 

(VPij - PPi.min) / (1 ~ WiPPi.mm) if Hi > 2 

and wagers 

{1 if m = 1 , 

R -f \o (50) 

1 - rHPPi,min if ru > 2 

when at node i. Here again /3 is arbitrary in (0, 1], and Pi, m in is the probability of the vertex 
least likely to be chosen by the Chooser: 

Pi.min = min{pij : j £ V, i — > j} 

We will refer to the strategies given by (|48p . (|49|l . and (|50p as the limiting strategies for the 
players. The next theorem gives the basic properties of the game under the limiting strategies. 

Theorem 7 (Limiting Strategies). Let Q = (V,£) be a strongly connected aperiodic graph. 
Let M be the propagation matrix for Q defined by (|43[). Let r be the maximal eigenvalue of 
M , and let v = (vi : i £ V) be the vector of limiting values. Then under the limiting strategies 
(1481) . (1491) . and (|50p the Chooser's transition probability matrix P is diagonally similar to the 
propagation matrix: 

P=~VMV~ 1 , (51) 
r 

where V — diag(v) is the diagonal matrix of limiting values. Under the limiting strategies, the 
Guesser 's fortune satisfies 

]imr t E[Ft\X =i] = cv i , (52) 

t— >oo 

where c is a positive constant that depends only on Q . 



Proof. Because the limiting reciprocal value vector u satisfies Mu = ru, equation (|48[) reduces 
to 

1 / 1 / 

r r 

for all i and j, and equation (|5ip follows. 

For (|52|) . start by considering a vertex i whose out-degree ru is at least 2. Then for any 

adjacent vertex j we have 

EIF^Xq = i,Xi =j] = (l + (n< - + (1 - Mi) (1 - qij) 

— Wi {niq itj — 1) + 1 

= 71, (Pi,j - /3pi,min) — (1 — TliPpi^min) + 1 

— I^i 

r «j ' 

Therefore 

Vi = E [F^Xo =i,Xi=j]r vj (53) 

In fact, (|53[) holds for vertices i of out-degree 1 as well. To see this, note that if i has out-degree 
1, and i — > j, then by equation ((43} and Mm = ru we see that 



But since i has out-degree 1, F\ —2 with probability 1, so in fact the last equation is equivalent 
to ((53)) . Thus, ((53)) holds for all vertices i and j such that i — > j. Now using the Markov 
property we have 

E [F t \X =t] = Y, E l F <\ x o =i,Xi = j] Pi ,j 
j 

= J2 E t Fl l^o =i,Xi= j] E [FU\X = j] p^ 

j 

= Y,--E[F t -AX = j] Pi>j 
j 

1 2 

7 J 



The matrix form of this equation is 

( 

where £t(i) = JS [-Ft|Xo = i]. Since £o = 1, we have 
and so by Lemma [3] 



= r- 2 V 2 MV- 2 Ct-i. 



C< = r"VM'r J l, (54) 



lim r'Ct = rV 2 M'F 2 l 

t — *oo 

= y 2 ^ V- 2 1. 

To see that the limit is a multiple of the limiting value vector v, denote G = r~ 2 V 2 MV~ 2 , 
so that 

Ct = G*l. (55) 
Note that G has the same pattern of zero and non-zero entries as M, and hence is irreducible 
and primitive. We claim that l/r is the maximal eigenvalue of G, and that the limiting value 
vector v is an associated eigenvector. To see this, first observe that 

Gv = r~ 2 V 2 MV~ 2 v 



so that 1/r is indeed an eigenvalue of G, with associated eigenvector v. To see that 1/r is the 
maximal eigenvalue of G, note that if (A, x) is any eigenpair for G, then Gx — Xx implies that 
M(V~ x) = r \(V~ x), so that r A is an eigenvalue of M, with associated eigenvector V~~ x. 
But r is the maximal eigenvalue of M, so we must have |r 2 Aj < r, which implies |A| < 1/r, 
which means that 1/r is the maximal eigenvalue of G. Now since G is primitive with maximal 
eigenvalue 1/r, it follows that r'G'l converges to some multiple of the associated eigenvector 
v. By (|55[) this means that 

lim r*E[Ft\Xo = i] = cvt. 

t — >oo 

This completes the proof. □ 

Because Q is strongly connected and aperiodic, the random walk on Q that arises from 
the limiting strategies is ergodic, and has a unique invariant measure p. Under this invariant 
measure, the discounted fortune process 

D t = r*F t (56) 

is stationary. Let [■] denote the expectation operator under the invariant measure (i.e. 
assuming that the initial position Xq of the players has distribution fi). The steady-state 
fortunes defined by 

m =E»[D t \X t =j), jev 

represent the Guesser's average discounted fortune when located at vertex j. By stationarity, 
they do not depend on t. The next theorem characterizes the invariant measure and steady- 
state fortunes. 

Theorem 8. Let Q be a strongly connected aperiodic graph with propagation matrix M and 
associated maximal eigenvalue r. Then the invariant measure /i for the position process 
(Xt : t > 0) from the limiting strategies is the entry-wise product of appropriately scaled right 
and left eigenvectors x and y respectively of M associated with r: 

H = -4~ (?iVi : i € V) . (57) 
x L y 

The steady state fortunes are of the form 

E,{D t \X t =j]=c^, (58) 

where c is a constant that depends on Q. 

Proof. The invariant measure /j, satisfies P T fi = fi. But P — r~ 1 VMV~ 1 , so the invariant 
measure must satisfy M T V fi = rV/i. By primitivity, V/i = cy, where y is the (unique) left 
eigenvector of M correponding to r. Thus 

fj, = cV~ 1 y = c (myi : i 6 V) j 

and (|57p follows because the reciprocal value vector u is a right eigenvector of M. 

For (I58[) . by conditioning on the position of the players at time t — 1, one sees that 

Vi 



E^ [D t \X t =j]=V] -E„ [D t \X t =i]p 



1 2 

£ --^rmjE^DtlXt =i] 

i:i — >i ^ 



The matrix form of this equation is 



rr, T = V T V 2 MV' 2 , 

where r\j = E^\Dt\Xt — j]. By the primitivity of M we have V 2 n = cy, and (|58|) now 
follows. □ 



Theorem 9 (Optimal play on strongly connected graphs). Let Q be a strongly connected 
aperiodic graph. Then the limiting strategies (|51l) . (I49|) . and (|50|) are optimal for the infinite- 
duration Path Guessing Game on Q . 



Proof Sketch. As in the proof of Theorem [5j we can restrict attention to purely positional 
strategies. For any choice of purely positional strategies by the Players, there will exist some 
discount factor d such that the limits 

lim cfE[Ft\Xo =i] = v* 

s — >oo 

exist and are non-zero. For this choice of strategies, the discounted game starting at node i is 
therefore equivalent to the game on the fan with root node i, whose leaves are the set of vetices 
j such that i — » j in G, and with vertex j having the value dv* . By the results of Section^ the 
players are playing optimally if and only if Mu* = du*, where u* is the vector of reciprocals 
of the entries of v* . But M is irreducible and primitive, so its only positive eigenvector is the 
reciprocal value vector u of Lemma [3] Thus u* = u, and it follows that the optimal strategies 
are the same as the limiting strategies. □ 

For what strongly connected aperiodic graphs Q is the Path Guessing Game fair, in the 
sense that the Guesser's expected fortune at the end of the game is equal to the $1 that she 
started out with? 

Theorem 10. Let Q be a strongly connected aperiodic graph. Then the Path Guessing Game 
on Q is fair if and only if each vertex of Q has out-degree at least two. 

Proof. If each vertex has out-degree two or more, then the propagation matrix M is stochastic, 
implying that r = 1 and u = 1, and it follows from (|54|l that the game is fair: E [F t \Xo = i] = 1 
for all i £ V. On the other hand, if there exists a vertex i of out-degree one, then M is strictly 
sub-stochastic, and in fact the i component of Ml equals 1/2. By primitivity there exists 
an integer to > such that M*° is strictly positive. It follows that that all the components 
of M' 0+1 l are strictly less than one, and hence limt^oo M'l = 0. Therefore the maximal 
eigenvalue r of M must be strictly less than one, and therefore the Guesser's expected fortune 
under optimal play grows without bound. □ 



6 The Lying Oracle Game 

In this section we apply the previous results to the Lying Oracle Game. As described in [T], 
this is a two-player game between an Oracle and a Bettor. The game is based on a sequence of 
n tosses of a fair coin. The Oracle has perfect foreknowledge of each toss, but may choose to 
lie to the Bettor in up to k of the n tosses. Before each toss, the Bettor privately writes down 
a guess as to whether she thinks the Oracle will lie or tell the truth on this toss. The Bettor 
also announces a wager on her guess. The Oracle then announces his (possibly mendacious) 
prediction. Finally the coin is then tossed, and the outcome is recorded. If the Bettor was 
correct, she wins the amount of her wager; if she was incorrect, she loses the amount of her 
wager. Play continues in this fashion until the n of tosses are completed. 

The Lying Oracle Game has been shown to have a reciprocal relationship with the contin- 
uous version of Ulam's search problem [3], which has been used as a model of binary search in 
the presence of errors [4|. In Ulam's search problem, a questioner is searching for a number 
in the interval [0, 1] chosen by a responder. The questioner asks the responder n questions 
about the number's location in the interval, but the responder may lie up to k times. The 
questioner seeks to find a subset of smallest measure that contains the chosen number, while 
the responder seeks to maximize the measure of that subset. Under optimal play, the measure 
of the questioner's subset is the reciprocal of the Bettor's fortune in the Lying Oracle Game. 

The Lying Oracle Game can be generalized in several ways. In [2] the authors consider 
the game when the coin is not fair. The "at most k lies in n tosses" rule can be generalized 
to a set of arbitrary "lying patterns" . Here, we make the point that the original Lying Oracle 
Game (with a fair coin) can be seen as a special case of the Path Guessing Game. Moreover, 
the results from the previous sections can be used to derive optimal play in the Lying Oracle 
Game when the number of coin tosses is infinite. 

Example 1. Consider the game Q n ,i in which the Oracle can lie at most one time in any 
block of n predictions. The corresponding graph is shown in Figure 1. 



Figure 1. The game Q n ,i- The Oracle can lie at most 1 time in any block of n tosses. Play 
starts at vertex 1. 



Note that since the out-degree of each vertex is two or less, our payoff rule |T]) coincides 
with the payoff rule for the Lying Oracle Game. The propagation matrix is M = hA, where 
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fl 1 
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1 




\1 
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0/ 



The maximal eigenvalue of M is r — ^X, where A = A(n) is the largest positive solution of 



= det (XI - A) = X n - A" 



- 1. 



The row-sums of A tell us that A £ [1,2], and it is not hard to show that as n approaches 
infinity A decreases to 1 monotonically. Thus r approaches 1/2, as we would expect. The right 
eigenvector of M corresponding to r is 

/ \ n — 1 i \ \2 \ n — 3 \n — 2\ 

35= (A ,1,A,A ,...,A ,A ) , 
and the left eigenvector corresponding to r is 

{ \n — 1 \7i — 2 \n— 3 \ i\ 

y = (X , A , A , . . . , A, 1) . 
The Oracle's optimal strategy at node 1 (the only non-trivial node) is 

\ — 1 \ — n 

pi,i = A , pi j2 = A . 

In other words, the Oracle tells the truth at node 1 with probability A , and lies with 
probability 1 — A" 1 = A _n . Note that as n increases without bound, the probability that the 
Oracle tells the truth when he is at node 1 approaches 1: 

lim = lim A -1 = 1 

n — >oo 71 — >oo 

The Bettor's minimum risk (/3 = 1) optimal strategy is 

qi,i = 1, gi,2 = 

* — 1 \ — 71 

= Wl,2 = A — A 

Under optimal play, the players perform a random walk through the graph, with invariant 
measure 

li= -(A",l,l,...,l) 

r A™ + n — 1 ' 

Observe that the fraction of time that the Oracle lies under optimal play is 

1 

X n + n — 1 

which increases to 1/n as n approaches infinity. Thus under optimal play the Oracle lies 
slightly less often than the rules allow. ■ 



We remark that if the Oracle's set of allowed "lying patterns" can be expressed in the form 
a finite set of finite forbidden patterns - sequences of truths and lies that the Oracle must avoid 
- then the Lying Oracle Game (with a fair coin) is equivalent to the Path Guessing Game on 
a certain finite graph. All games in which the Oracle can lie at most k times in any block of n 
statements fall into this class, and as such can be analyzed using the results of Section [5] Our 
approach also leads to new variants of the Lying Oracle Game in which the Oracle is allowed 
to end the game under certain conditions, rather than after a specified number of tosses. For 
instance, we can consider a game in which the Oracle can lie at most k times in any block of 
n tosses, and can stop the game after any toss on which he told the truth. In these games, 
the Guesser tries to predict whether the Oracle will lie, tell the truth, or stop the game. Such 
games correspond to a Path Guessing Game on a terminating graph. 

Example 2. Consider the game in which the Oracle can lie at most 1 time in any block of 
n tosses, and can stop the game on the first round, or after any round on which he told the 
truth. The graph for this game is shown in Figure 2. 




Figure 2. The Oracle can lie at most 1 time in any block of n tosses, and the Oracle can choose 
to stop after any toss on which he told the truth. Play starts at vertex 1. 



There are n nonterminal nodes and one terminal node. Using the results of Section |4l we 
find that under optimal play the probability pi, n +i that the Oracle stops the game from node 
i is given by 



Pi, 71+1 = < 



3 (2" 



if i = 1 

if i = 2 

if 3 < i < n 



It is easy to show that if n and i go to infinity concurrently such that i/n approaches x € (0, 1), 
then Pi, n +i approaches 1/2. We also have pi, n +i approaching 4/9 as n approaches infinity. 
Thus, even for large n, the game duration is likely to be very short. ■ 
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